Model Based Reinforcement Learning: Policy Iteration, Value Iteration, And Dynamic Programming